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ABSTRACT 

This self-instructional, job-oriented booklet on 
descriptive statistics for the health professions deals specifically 
with measures of central tendency (mean, median and mode) . It is 
limited to those concepts and techniques most needed by health 
professionals working routinely with basic statistical data. (CK) 
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SPECIFICATIONS 



Instructional Objectives 

After taking this Lesson as directed the student: 

1. Can verbally define Measure of Central Tendency, Arithmetic Mean, Median, and Mode. 

2. Can verbally describe when it is particularly appropriate to use the Arithmetic Mean, the Median, and the 

Mode. 

3. Can verbally describe the data needed to compute the Arithmetic Mean or the Median. 

4. Can verbally describe how an Arithmetic Mean or a Median may be used. 

5. Given actual or verbal description of situations and/or data, can name from memory the Measure of Central 
Tendency ( Arithmetic Mean, Median or Mode) most appropriate for use. 

6. Given data (raw or in tabular form), can match it with certain descriptive factors: continuous, discrete, 
N < 50, NJ> 50, and value range of> 14 or < 15. 

[See Limitations, Restrictions, and Special Characteristics below.] 



Primary Trainee Population 

Public Health nurses and sanitarians with college degrees or equivalent. 



Secondary Trainee Population 

A. Public Health veterinarians, physicians, dentists, and other similarly related Public Health workers with college 
degrees or the equivalent should also be able to use this Lesson; however, the examples used in this booklet will 
not be relevant to this grot p. 

B. With proper motivation and some additional effort, Public Health nurses and sanitarians with a high school 
education should also be able to use this Lesson . 



Student Study Time 

This Lesson should require from 2-4 hours, exclusive of breaks. We suggest that the student take a break at least 
every 1 to 1 Vi hours. The student should make every effort to complete the Lesson within a two-day period. 



Individualize tlon 

At least 20-25% of the frames may be skipped by the student, depending on his own needs. Of course, there is no 
time limit imposed-thc student may proceed at his own best rate. 



Limits tio ns, Res trie tions, and Special Charac ter! sties 

A. The verbal definitions required of the student (see Instmctional Objectives above) arc brief and nontechnical. 

B. The Lesson docs not teach the student the procedures and techniques for computing the Measures of Central 
Tendency presented. However, the student should be able to use the companion computational guides more 
efficiently (less time-fewer errors). 

C. For maximum effectiveness for both this Lesson and its companion computational guides, we suggest you follow 
the study of this Lesson with the use of the guides as soon as possible. 
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PREFACE 



In response to a general need voiced by students and teachers alike, we have developed a self-contained, job-oriented 
instructional package on Descriptive Statistics for the Health Professions. This is not meant to be an exhaustive 
treatment of statistics in general; it is limited, first, to descriptive statistics and, second, to those concepts and 
techniques most needed by health professionals working routinely with the basic statistical data. This attempt at job 
relatedness is also reflected in the post-instructional aims-we want the student to be able to put statistics to 
practical use, not converse in highly theoretical terms. 

Because we have sought operational relevancy and technical simplicity, two cautions art in order: 

(1) We have used health data in our examples in order to put the health professional in familiar surroundings. 
However, in our eagerness to keep the necessary basic math simple and the text unencumbered, we may 
have in places stretched the plausibility of certain health phenomena. Therefore, pL-ase don’t take offense 
but rather remember that the health data is not intended to be authentic, only familiar. 

(2) Also, in keeping with our simple, practical approach, highly complicated, technical concepts, definitions, 
and techniques have been avoided. Whenever this approach has conflicted with technical completeness, we 
have decided in favor of simplicity and practicality if technical accuracy is not violated. (Therefore, 
professional statisticians, please take note and do not hold your fellow professionals— our consulting 
statisticians-resonsiblc for any instructional liberties.) 

Descriptive Statistics for the Health Professions is concerned with only those statistics that are generally classified as 
descriptive statistics: 

(1) tables 

(2) graphs 

(3) descriptive ratios 

(4) measures of central tendency 

(5) measures of dispersion 

The present booklet is a programmed self-instructional Lesson on the selection and use of the appropriate measure of 
central tendency. The Lesson should be taken prior to (he use of its companion Guides, Arithmetic Mean: 
Computational Guide and Median: Computational Guide. A unique characteristic of this Lesson is that 
computational techniques, easily forgotten or made vague through disuse, are not taught. Such detailed techniques 
are covered in the Guides which are to be used when an actual need arises. Techniques are mentioned in the Lesson 
only as is necessary to make more meaningful the definitions of the specific measure of central tendency. 

We feel strongly that this Lesson, when properly used, should significantly reduce training time and costs, reduce the 
public health professional’s aversion to using statistics, and increase the effectiveness with which statistics are 
applied. 
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HOW TO USE THIS LESSON 

This Lesson is probably different from any you have ever used before-it is certainly 
different from the usual textbook or study manual. 

On almost every page of th e Lesson you will be asked to answer questions about what you 
are studying. Because of this “question and answer” way of teaching, many people 
confuse this type of lesson with a test. But it is n^t a test; it is a lesson that asks questions 
often and at the best times to help you learn. The questions help you think about what 
you are learning; the correct answers are given so that you can immediately see that you 
answered correctly and that you are learning. 

1 In the front of this booklet you saw a small strip of cardboard called the ANSWER 
COVER. If you have not already done so, remove i t...JVou\ place the ANSWER COVER 

over the gray part of the page on the right ► 

The ANSWER COVER should now cover the entire gray area so this page looks like the 
one pictured below; docs it? yes / no (Draw a circle around the correct answer.) 




You should have circled one of the “yes” or “no” answers with your pencil. If you did 
not, do so now... .That was an easy question to answer, but to see that you are correct 
move the ANSWER COVER do f vn the page until its top is even with the line below 



2 Have you read what is written in the gray area? If not, do so now. 

You see from what you just did that when a slash (/) is used to separate two or more 
words you must circle the correct answer. You may also be asked to check (\/) the right 
answer or write your answer in a blank. For example, answer the questions beiow: 

1. This is / is not a test. (Circle the correct answer.) 

2. Is this a lesson to help you learn? (Check the correct answer.) 

yes 

no 

3. This Lesson is part of course on Descriptive for the Health 

Professions. (Write your answer in the blank-.) 

Now see if you have given the correct answers ►- 




1 
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BEFORE YOU READ ANY FURTHER BE SURE TO COVER THE GRAY AREA 
WITH THE ANSWER COVER. DO THIS EACH TIME YOU START A NEW PAGE. 

3 You will notice that what you arc studying is divided into parts containing various 
amounts of information and questions. These parts arc called “frames.” 

Most frameshave (1) a certain amount of information and 

(2) questions for you to answer about the information in that frame or 
about other information you have studied before. 

Is what you arc now reading part of a frame? (Check one.) 

yes 

no 

Don’t forget to see if your answer is correct after you have written it; move the ANSWER 
COVER down to the next line 



4 You will be able to answer many of the questions correctly. However, when you arc 
wrong you should do which one of the following: 

1. Change your written answer; then go to the next frame. 

2. Try to to see why you were wrong; then change your written answer and 

go to the next frame. 

3. Go to the next frame. 

4. Start over again. 



5 The correct answer to the last question is very important. 

Just copying the correct answer when you are wrong will not help you learn as you 
should. 

Looking at the correct answer before you write your own answer to the question 
will not help you learn as you should. 

Copying your answers will make a difference only to you since it will keep you from 
learning as well as you might otherwise. This failure to learn the material will show up 
later in post-lesson testing or in on-the-job performance. 

To learn as you should you must: 

1. read everything carefully. 

2. follow instructions. 

3. write your answers before looking at the correct answers given. 

4. try to see why you were wrong-don’t just copy the correct answer when 

you make a mistake. 

5. take all the time you need-this lesson was written so that you can set 

your own pace. 
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INTRODUCTION 

6 Because portions of the next several pages appear in part in other booklets of the course 
on Descriptive Statistics for the Health Professions ; you may encounter some slight 
repetition if you have already studied one or more of the other booklets. However, we 
strongly recommend you give the entire Introduction your full attention. 



7 Professionals in Public Health frequently use statistical methods to describe or predict 
{infer). However, these two classifications of statistics-descriptive and inferential-arc not 
mutually exclusive; we must describe before we can infer. For example, descriptive 
statistics may be used to show that more men than women died from Disease “D,” but 
without inferential statistics we could not infer that there was a real rather than a chance 
difference between men and women with regard to Disease “D,” nor would we be able to 
predict that there would continue to be such a difference in the future. 

Of the five statistics we have classified as “descriptive,” this Lesson is concerned with 
(circle one) . . . 

1. tables 

2. graphs 

3. descriptive ratios 

4. measures of central tendency 

5. measures of dispersion 



MEASURES OF CENTRAL TENDENCY 
INTRODUCTION 



8 Before we go any further in our specific discussion of “measures of central tendency, 
let’s consider the basic working materials of descriptive statistics. 

Example: 

A frequency distribution based on labotatoty data from Warren County Hospital in 1966 
is represented in crude graph form below: 
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FREQUENCY 

OF 4 
LYMPHATIC 
LEUKEMIA 2 



0 



~T~ 

10 






~r 

20 



~T~ 

30 



~T~ 

40 



~ I 
50 



DURATION IN MONTHS 



Now see if you can answer the following questions without spending too much time on 
the ones you don’t know: 

1. Give a proper verbal description of the group being considered in the graph. 





2. What is the total frequency of the group? 

3. What is the factor (variable) being studied (allowed to vary)? 




4. What is being distributed? 

5. How many cases are there for each value as represented in the graph? 



6. Could male cases be included in the group? 

yes 

no 

7. Could cases discovered in 1967 whUe the report was being prepared be included? 

yes 

no 



Check your answers. 

If you answered all the questions correctly, skip to Frame 26. 

If you could not answer all the questions, go on to the next frame. 




y. 
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9 Descriptive statistics may be thought of as a way of describing, in numerical terms, 
something about GROUPS of “cases” (people or events) having common characteristics. 
That is, all cases of the groups arc matched (identical) with regard to certain 
characteristics. For example . . . 



THIS GROUP 




Cases of laboratory-cpn- 
firmed canicola fever fn 
Columbus, Georgia, 1962. 



HAS THESE CHARACTERISTICS IN COMMON 
FOR ALL ITS CASES 




(1) all were diagnose d-laboratory- 
confirmed-as canicola fever 

(2) all occurred in Columbus, Georgia 

(3) all occurred during 1962 



Although the common characteristics that are made explicit restrict and control the 
group, certain other characteristics may be true of the group and may be allowed to vary. 
For example, check (\/) the cases below that may be included in the group described 
above: 



cases not laboratory confirmed 

cases laboratory confirmed 

male cases 

female cases 

cases in Columbus, Ga. 

cases not in Columbus, Ga. 

cases over 21 years old 



cases under 21 years old 
cases in high socioeconomic 
setting 

cases in low socioeconomic 
setting 

cases not in 1962 
cases in 1962 



10 Sometimes characteristics that apply to only part of a group arc used in a statement 
about the group. For example . . . 



READ THIS STATEMENT 




Of a group of 185 1 1-year- 
old boys, many of whom 
weigh about 60 pounds, 
each is 56 inches tall, 
three-fourths are in the 
5th grade, and the rest are 
in other grades. 



AND LIST ONLY THE CHARACTERISTICS 
ALL MEMBERS OF THE GROUP HAVE IN COMMON 




All 

All 

All 



i-i 
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11 Sometimes we tend to think of the size (count) of a group as a common characteristic. 
This is obviously not correct, as you can easily see by trying to describe just one case in 
the group in Frame 10; for example, “The case must be a boy, 1 1 years old, 56 inches tall 
(but certainly not € 1 85’)/^ In descriptive statistics the terms “total frequency” or 

“number” are usually used to refer to the of a 

group. 



12 We have already pointed out that for any group, many characteristics are not held 
constant but rather are allowed to vary. In fact, a group is often defined in order to see 
how certain characteristics VARY with respect to the group. In the statement below . . . 

“Of 185 1 1-year-old boys, each is 56 inches tall, many weigh about 60 pounds, 3/4 s 
are in the 5th grade, and the rest are in the other grades.” 

We can see that for the group implicit in the statement, weight is certainly not a common 
characteristic that is being held constant. If we wish to study more exactly how weight 
varies, we might formalize the statement thusly: 

“A group of 185 56-inch-tall, 1 1-year-old boys, by weight in pounds .” 

Now, you list the following about the above proposed study . . . 

THESE ARE THE COMMON 

CHARACTERISTICS THAT AND THIS IS THE COMMON CHARACTERISTIC 



DEFINE THE GROUP THAT WILL VARY 




The number 185 is the for 

the group. 
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13 In most instances the characteristic that is being studied (allowed to vary) is preceded by 
a certain preposition; circle this preposition in the following observation . . . 

“Twenty-five cases of laboratory-confirmed canicola fever, by age, in Columbus, 
Georgia, 1962. 

14 In a study, the common characteristics of a group are held constant while one or more 

common characteristics are allowed to 

15 The common characteristic that is allowed to vary may be referred to as the 

study able. 



16 A number of cases having certain constant common characteristics is called a . 

Its size or count is called the • 

The common characteristic that is allowed to vary is called the study 



tion 



and is usually preceded by the preposi- 

in a formal statement of the study. 
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17 In the two observations below . . . 

underline the common characteristics of the group 
double underline the total frequency of the group 
( circle ) the study variable 
Example 1 

“Distribution of 25 males, ages 25-50 years, by grams (g.) of hemoglobin per 100 
milliliters (ml.) of blood, Washoo County, 1960. 

Example 2 

“Distribution of 25 males with 16-17 grams (g.)of hemoglobin per 100 milliliters (ml.) 
of blood, by age, Washoo County, 1960. 



18 In only one of the two examples above is the unit by which the study .variable is to 
measured made explicit. This is not unusual when such information is assumed to 
implicit in the name of the variable itself, i.e., in Example 2 the missing unit of measur 
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19 We are now ready to discuss “frequency distributions.” Notice that the two statements in 
the last frame begin with the word “distribution.” Actually, this is a standard way of 
saying that we are going to look at the particular way in which individual frequencies of 
the group are distributed among the various values of the study variable. A frequency 
distribution is often represented graphically. We see this in oversimplified form as 
follows: 

“Distribution of 10 boys with measles by age ” 

Each dot represents a boy with 
measles; there are 10 boys (total 




In the above representation we can see that the individual frequencies (totaling 10) are 
distributed among the various values of the study variable so that no age value is 

represented by more than case(s) of measles. 



20 Often, individual frequencies are distributed so that a particular age value is represented 
by more than one case. For example . . . 



Dots represent the 

frequency 

distributed among 
the values of the 
study variable — ► 



6 yrs. 



• • 

• • • 

• • • 



7 yrs. 8 yrs. 9 yrs. 






10 yrs. 



According to the frequency distribution, how many cases are 

6 years old? 8 years old? 

7 years old? 9 years old? 

10 years old ? 



A - 




9 



MEASURES OF CENTRAL TENDENCY 
INTRODUCTION 



21 The relationship of the frequency of a group to the various values of the study variable is 
referred to as the ’ 



22 In the last several frames we have referred to frequency distributions used in examples by 

a single word, as in “ of 185 1 • -year-old boys by 

weight . . . 



23 You should now be able to answer all the questions about the following distribution 
based on laboratory data from the Lowin County Clinic in 1964: 



FREQUENCY 

OF 

UTERINE 

FIBROIDS 
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1. What is a proper verbal description of the group being considered in the crude graph? 



2. What is the total frequency for the group? 

3. What is the study variable? 

4. What is being distributed? 

5. How many cases are there for each value (weight)? 

6. Could cases discovered in 1965 while the report was being prepared be included? 

yes 

no 



1 ;' 
I * 
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24 If you made no errors in the last frame, go directly to Frame 26. 

If you made errors, study for a moment why and then go to the next frame. 

25 A frequency distribution based on laboratory data from Warren County Hospital in 1966 
is represented in crude graph form below. 
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1 . Describe the group. 



2. What is the group’s total frequency? 

3. What is the study variable? 

4. What is being distributed? 



5. How many cases are there for each value? 



6. Is the group restricted by sex? 

yes 

no 

7. Is the group restricted by time of occurrence? 

yes 

no 



It 
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A MEASURE OF CENTRAL TENDENCY 

26 If this lesson had been called “averages” rather than “Measures of Central Tendency,” 
more people would know from its title what is being taught. How do you now define 
“Measures of Central Tendency”-write your definition below: 



27 Does your definition match in meaning, if not stated word-for-word, the one below: 

“A measure of central tendency is a value that is used to represent the center of a 
distribution of values. It is considered to be a representative value which can be used in 
place of numerous individual values.” 

If your definition matches the one above, go now directly to Frame 33. 

Or, if not, does your definition match the one below better: 

“A value obtained by adding all the individual values of a distribution and dividing by 
the number of values. It is considered to be a representative value which can be used in 
place of numerous individual values.” 

If so, go to the next frame. 



28 The second definition given in the last frame is probably what most people think of as a 
measure of central tendency or average. However, it more appropriately identifies a 
particular type of average known technically as the “arithmetic mean.” Therefore, your 
answer, though not correct, is not wholly incorrect-just too specific as you will see as 
you continue reading. 
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29 



The definition for measure of central tendency (average) we expect you to learn is: “A 
value that is used to represent the center of a distribution of values. It is considered to be 
a representative value which can be used in place of numerous individual values. 

Although the definition you used originally may suggest the same meaning as the one 
above, a key word that states the function of a Measure of Central Tendency and tha 

should be used in its definition is — — * 



30 The average may be though ; of as representing (a) the 



of the 



distribution and (b) the individual 



of the distribution. 



31 A definition of “Measure of Central Tendency’’ must indicate that its function is to 

^ of a distribution 



and that it may be used to 
values of the 



the 






32 See if you can now state from memory the definition of Measure of Central Tendency we 
will use in this Lesson: 



& 
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SPECIFIC MEASURES OF CENTRAL TENDENCY 

33 Ws will now discuss the three most commonly used Measures of Central Tendency: the 
arifhmetic mean, the median, and the mode. 
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SPECIFIC MEASURES OF CENTRAL TENDENCY: MODE 

34 In this Lesson we are defining mode as: “A Measure of Central Tendency which, for any 
list of values, is the single value or group of values which occurs most often.” The mode 
in each of the two simple distributions shown graphically below is the number of 
injections occurring most often. What are the modes in the two distributions: 

1. MODE: 




2. MODE: 




NUMBER OF INJECTIONS 



16 
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3 



35 In part 1. of the last frame six injections was the mode because it was the 

_ value that occurred most often; in part 2., 1-2 injections was 

the mode because it was the 

that occurred most often. 



! 
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36 What is the mode for each of the worktables shown below: 
1. MODE: 



2. MODE: 



WORKTABLE: Patients Dying From Heart Rupture By Age In Years, Los 
Angeles County Hospital, July 1941 -Oct 195V 



Age in 
Years* 


Number 

of 

Patients 


50-54 


2 


55-59 


5 


6a69 


27 


70-79 


33 


80-89 


13 


80 


•Age at last birthday 




WORKTABLE: Distribution 


of 75 Restaurants By Number Of 


Inspections During The Year, Center County, 1965 


Number 


Number 


of 


of 


Inspections 


Restaurants 


2 


6 


4 


12 


6 


22 


8 


19 


10 


11 


12 


4 


74 
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37 In part 2. of the last frame, six inspections (the mode) was the 

that occurred 



; in part 1., 70-79 



years was the 



that occurred 



38 Now state from memory the definition of mode we are using in this Lesson: 



39 Before you continue, see if you can recall from memory the definition of A Measure of 
Central Tendency ; think your answer. 



19 







MEASURES OF CENTRAL TENDENCY 



SPECIFIC MEASURES OF CENTRAL TENDENCY: ARITHMETIC MEAN 

40 In this Lesson we are defining “arithmetic mean” (or simply “mean”) as: “A Measure of 
Central Tendency obtained by adding all the individual values and dividing by the number 
of values.” Actually when most people use the word “average” (a term roughly 
synonymous with Measure of Central Tendency), they are talking about the mean. 

The college student who computes his quality-point “average” is actually finding his 
mean quality point value. To find the mean he . . . 

1. lists the quality points he has received for each hour of credit, 

2. adds up the list of values (quality points), and 

3. divides the total value by the total number of credit hours. 

The ages of a group of children who have the measles are 1 , 3, 7, 9, 9, 12 , and 15. 

What is their mean age? 



41 To find the mean age in the last frame . . . 

1. list all ages 1 

3 

7 

9 

9 

12 

15 

2. add all values 56 g y ears j s g r0 up’s mean age 

3. divide by number of children 7)56 



42 Recall (and write) from memory the definition of the mean: 



0 



”20 



MEASURES OF CENTRAL TENDENCY 
SPECIFIC MEASURES OF CENTRAL TENDENCY: ARITHMETIC MEAN 



43 The mean is often referred to as a “weighted” average since the size (weight) of each 
individual value is mathematically reflected in the mean value for the group. Because the 
mean is so mathematically sensitive to the size of all the individual cases, atypical values- 

extremely high or low-will tend to bias the mean in their 



44 In the following distribution of ages . . . 

5, 6, 20, 22, 23, 24, 26, 27, 30, 3 1 , 32, 34 
the mean will be: (check one or more) 

(a) lower than it should be to represent the distribution “operationally” 

(b) higher than it should be to represent the distribution “operationally” 

(c) a mathematically correct Measure of Central Tendency 

(d) none of the above 
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MEASURES OF CENTRAL TENDENCY 



SPECIFIC MEASURES OF CENTRAL TENDENCY: MEDIAN 

45 In this Lesson we are defining “median” as: “A Measure of Central Tendency which, for 
any distribution of values ranked from smallest to largest, is above one half and below the 
other half of the values.” 

Below, seven cases have been ranked in order from youngest to oldest. The middle 
position is occupied by a case whose value is 1 1 years: 



MIDDLE 

POSITION 



3 



3 



Cases 



Cases 



Cases 




11 




IN YEARS 



♦ 



MEDIAN age = 11 years 



The seven cases in the distribution were ranked in order from _ 
to • 

Is there an atypical age among the ranked cases-if so, what is it? 
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I MEASURES OF CENTRAL TENDENCY 

| SPECIFIC MEASURES OF CENTRAL TENDENCY: MEDIAN 

46 The great advantage of the median as a Measure of Central Tendency is that it is not a 
“weighted” average as is the mean. Therefore, it is not affected by the extreme value 
(size) of any case in the distribution. Values are used only to assign the rank position to 
j the cases; the value of the case in the middle position is the median value. 

! A group of seven cases by weight is shown below . . . 
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1 
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A 


X 


JC 


J C_ 


Now, you rearrange the cases in rank order by filling in the values below: 












1 1 — 1 LI 






Draw an arrow to the middle position in the rank order ... therefore, what is the 
median? 

Are there any atypical values (cases) in the distribution? 

yes 

no 

If so, list the value(s). 

Was the selection of the median affected by the atypical size of the case value( s )? 
yes 
no 






23 2S 

ER?C 

.HEEfl 







MEASURES OF CENTRAL TENDENCY 

SPECIFIC MEASURES OF CENTRAL TENDENCY: MEDIAN 

47 Recall and write from memory the definition of the median: 



48 From memory, recall the definition of mean', think your answer. 



49 From memory, recall the definition of mode; think your answer. 



50 From memory, recall the definition of Measure of Central Tendency; think your answer. 
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MEASURES OF CENTRAL TENDENCY 



MEAN AND MEDIAN COMPARED 

51 Complete the two statements below by filling in the letter a or b: 

1. The mean a. is more mathematically sensitive to the sizes 

(weights) of the values of a distribution. 

2. The median b. is sensitive to the size of the values of a 

distribution only to the extent that they affect the 
ranked position of the values. 



MEASURES OF CENTRAL TENDENCY 
MEAN AND MEDIAN COMPARED 



52 Let’s see more specifically how the size (weight) of the individual values affects the mean 
and median in the following distribution. 



The values in this list 
are in a random order 
with no regard to rank. — 



_ — The same values are 
listed below in rank 
order-from smallest 
to largest. 



4 

6 

2 

8 

7 

6 

3 

4 

5 




What is the mean value of 
the list of values-use the 

column on the left 

(mean) 



What is the median value of 
the list of values-use the 

column on the right 

(median) 



► 



2 

3 

4 

4 

5 

6 
6 

7 

8 



53 You should notice that the 
mean value of 5 could be 
determined by simply adding 
the random list of values and 
dividing this total (45) by / 
the total number of values (9). 



























However, to determine the 
median of 5, it was necessary 
to rank the values from small- 
est to largest to find which 
value represents the middle 
position below one half the 
values and above the other half. 



54 For the values in Frame 52, both mean and median are 5. What if the value 8 were 
changed to 17-would the mean and median still be equal? 

17 



4, 6, 2, *, 7, 6, 3, 4, 5 



yes 

.no 

don’t know 
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No, the new mean and median are not equal when the value 8 is changed to 17. Let s see 
what it would be-we will demonstrate with only the list that is ranked even though it is 
not needed for the mean . . . 



We see that 5 still occupies the 
middle position in the ranked data 
and, therefore, is still the 
median value 

However, the change in size (weight) 
of one of the values has such a direct 

effect on the mean that it is changed 



2 

3 

4 

► MEDIAN 5 

^POSITION 

6 1 

6 

7 

$ 17 

^54T? = 6 MEAN 



We see that although the mean and median for a distribution may at times be the 
same, the median is affected simply by relative position of values, whereas, the mean is 
much more sensitive to individual size (weight) of values. Because of the greater 
mathematical sensitivity of the mean, we pointed out earlier how it is “biased” 
(“weighted”) in the direction of an extreme atypical value— did this happen in the 
example above? 

yes 

no 

don’t know 
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MEAN AND MEDIAN COMPARED 



s 






57 Yes, by increasing the 8 to 17, an atypical value was introduced to the list and the mean 
was weighted unrealistically in the direction of the larger values. Study the illustration 
below: 



MIDDLE POSITION: MEDIAN 



2 

3 

4 
4 

► 5 
6 
6 
7 
17 



Most values are under 10 



54-r 9 = 6 MEAN 



We see that the median value of 5 is more representative of most values than is the mean 
(6) which increased in order to take into account the extreme value 17. 
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MEAN AND MEDIAN COMPARED 
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58 A list of ages can also be modified by using open-ended intervals. An open-ended interval 
is an interval in which only one limit, or “end,” is known. Both “>55” (greater than 55) 

and “<8” (less than 8) are intervals. 



59 Now let’s add an open-ended interval to our list of ages to see the effect it has on the 
mean and median: 

2 

3 

4 

4 

5 

6 

6 and two values >6 (greater than 6) 

Can the median be computed in the above example-check and complete the answer 
below: 



yes, the median value is 
no, because 



don’t know 



60 Yes, the median can be computed even when the data contain an open-ended interval— if 
you know the frequency involved, and if the median does not fall in the open-ended 
interval. 

In the above example we know that there are 9 values in all (the total frequency or 
number), we can make an approximate ranking to find the value of the middle position, 
and we know from what’s given that the middle position does not fall in the open-ended 
interval. With this in mind, what is the median for the distribution below: 

46 , 40, 40,48,47,45,44,43,41,42,39, 42,45,49,42, 2 values under 39, and 4 values 
over 50. 



MEDIAN is 

HINT: Rank your values and then find the middle position. 
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MEASURES OF CENTRAL TENDENCY 
MEAN AND MEDIAN COMPARED 



61 When all specific values of a distribution are not known because there is an open-ended 
interval, for example . . . 

2 

3 

4 

4 

5 

6 

6 and two values >6 

can the mean be computed; check and complete the answer below: 

yes, the mean value is 

no, because 



don’t know 



62 The mean is found by dividing the sum of the values by the number (total frequency) of 
values. In the example above we know the total frequency (9) but we cannot determine 
the sum of values because we do not know the two values in the open-ended interval. 



63 A comparison of the mean and median indicates the following characteristics 
( circle the correct answer ) : 

I Extremely high or low (atypical) values in a distribution will unrealistically (and 
impractically) bias the mean / median in their direction. 

2. When a distribution contains an open-ended.inteival, only the mean / median 
can be computed. 

3. With respect to atypical values and open-ended intervals, the mode is most like the 

mean /median . 
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64 Is it enough to say that all that is needed to compute the mean, median, or mode is the 
total number of values in a distribution and a listing of each value that occurs in that 
distribution? 

yes 

no 

don’t know 



65 No, we had hoped you would know that such an unqualified statement is not valid. 
Although the median can sometimes be computed when all values are not known, the 
mean requires that all values and their frequencies be reported so that we can compute the 
sum of all values and the total number of values. 



MEASURES OF CENTRAL TENDENCY 
MEAN AND MEDIAN COMPARED 



66 We have already touched on how the completeness of your data affects the computability 
of the Measure of Central Tendency. More explicitly, to compute the mean we use a list 
of all the individual values or a table in which all the values are listed according to their 
frequency of occurrence— no open-ended intervals are permissible since we need to find the 
sum of all values and the total number of values. Can the mean number of visits be 
computed for the following examples: 

1 . Thirty patients visited a clinic 2, 3, 4, 5, 6, 8, 10, or 20 times apiece during 

its first quarter of operation. 

2. During its first quarter of operation a clinic had patients make the following 

number of repeated visits: 2, 4, 8, 10, 6, 5, 4, 3, 2, 4, 3, 2, 6, 8, 8, 10, 20, 
20, 2,3,4, 6, 5, 3, 4, 5,4, 3, 2, 8. 



67 We have said that to compute the mean we use a list of all the individual values In 

example one above we have the total number of cases (30) given, but only the types of 
values indicated. What is needed is how often 2 visits were made, how often 3 visits were 
made, etc. 

In example two above we are given a list with each individual value listed as often as it 
occurs. With this information we can find the sum of the visits and the number of cases 
involved. 



68 To compute the MEAN we use a list of all the 

or a in which the values are 

listed according to their occurrence-no 

open-ended intervals are permissible. 
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69 Can the mean be computed for any of the examples to follow: 

1. Distribution of ages of patients at a special clinic during January: 46, 40, 

40,48,47,45, 44, 43, 41, 42, 39, 42, 45, 49, 42, 2 ages under 39, and 4 
ages over 50. 



. 2 . 

WORKTABLE: Distribution Of 175 Preschool Children By Number Of 
Immunizations, Center County, 1963 



Number of 


Number of 


Immunizations 


Children 


1 


15 


3 


23 


4 


40 


6 


3B 


7 


30 


B 


29 


Total 


N* - 175 



*N is a symbol used for /Vumber (total frequency) 



3. 

WORKTABLE: Distribution Of Cases Of Poliomyelitis By Age In Years, 
Center County, January-June, 1962 





Age in 


Cases of 




Years 


Poliomyelitis 




Under 4 


25 




5 


10 




6 


12 




7 


9 




B 


16 




9 


13 




10 


14 




Total 


N -99 
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69 (continued) 



4 . 



WORKTABLE: Number Of Longshoremen Covered By Medical Care Plan 
For 12 Months, By Age In Years On July 1, 1955, Stockton, Calif., July 
1955-June 1956 



Ago in 


Number of 


Years 


Longshoremen 


16-19 


5 


20*24 


22 


25-29 


47 


30-34 


43 


35*39 


55 


4044 


91 


45*49 


78 


50*54 


68 


55*59 


34 


60*64 


30 


65*69 


16 


> 69 


3 


Total 


N - 492 



5. 



WORKTABLE: Distribution Of 100 One-Year-Old Babies, By Weight In 
Pounds, Center County Baby Clinic, 1965 



Weltftt in 


One-year- 


Pounds 


Old Babies 


20*21 


4 


22-23 


15 


24-25 


31 


26-27 


35 


28-29 


12 


30*31 


3 


Total 


N - 100 
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70 The mean can be computed from data in list or in tabular form-but no open-ended 
intervals are permissible . 

In example one on page 34, though you can determine that 22 patients are involved, the 
sum of ages cannot be found when you arc not given the particular ages less than 39 and 
more than 50. 

In example two, the number of cases is 175, and you are given all individual values even 
though they are grouped; therefore, the sum of values can be found. 

In example three, the particular values under 4 years are not given. 

In example four, the particular values over 69 years are not given. 

In example five, enough information is given so that you can find the midpoint values to 
represent all individual values~this will allow the mean to be computed. 

We see that the mean could not be computed for examples 1, 3, and 4 because the data 

given contained intervals. 



71 To compute the MEAN we use 
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72 The data used to compute the median is similar to that used to compute the mean. 
However, does the median require that no open-ended intervals be present? 

no 

yes 

don’t know 



73 Recall that the median is that value occupying the “middle position” among values 
ranked from smallest to largest. For example, consider the ranked data below: 

3 values < 2, 2, 2, 3, 4, 4, 4, 5, 6, 6, 7, 8, 9, 4 >9 

Although both ends of the ranked values are “open” we know that there is a total of 19 

values and that the middle position is occupied by . 



74 We see from the last example that open-ended data does not prevent us from finding the 
median value as long as the inclusive individual frequencies of the open-ended intervals 
are given, and the median does not fall in an open-ended interval. 



75 To compute the mean or median we use a list of the 

values or a in which the values are listed according to 

their of occurrence. To compute the mean the 



data may contain 



pute the median the data may contain 



intervals. To corn- 
intervals if 



inclusive frequencies of open-ended intervals are given and the median does not fall in an 
interval. 
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76 In addition to open-ended intervals, data may also contain missing values, for example: 

2 

3 

4 

5 

6 

and 2 unknown values 

Can the mean be computed in the above example-check and complete the answer below: 
yes, the mean value is 

no, because 



don’t know 



77 The mean is found by dividing the sum of the values by the number (total frequency) of 
values. In the example above we know the total frequency (7) but we cannot determine 
the sum of values because we do not know the missing (unknown) values. 
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78 When the data contains missing values, for example . . . 

2 

3 

4 

5 

6 

and 2 unknown values 

can the median be computed; check and complete the answer below: 
yes, the median value is 

no, because 



don’t know 



79 The median cannot be computed when the data contains missing values, even if we know 
the frequency. In the above example we do not know what position the unknown values 
would occupy in the ranking; therefore, we cannot find the middle position, or median. 
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80 Place the appropriate letter beside the descriptions of data below to show that it can 
be used to compute . . . 

a. the mean 

b. the median 

c. either 

d. neither 

1. Distribution of ages of patients at a special clinic during January: 46, 40, 40, 

48, 47, 45, 44, 43 , 41,42, 39, 42, 45, 49, 42, 2 ages under 39, and 4 ages 
over 50. 



2 . 



WORKTABLE: Distribution Of Intensive Cere Patients By A®e In Years, 
General Hospital, 1960 



Ago in 
Yean 


Number of 
Patients 


1 1*20 


5 


21*30 


9 


31-40 


20 


41*50 


41 


51-60 


39 


> 60 


143 


Total 


N - 257 



3. 



WORKTABLE: Distribution Of 176 Preschool Children By Number Of Im- 
munizations, Center County, 1963 



Number of 
Immunizations 


Number of 
Children 


1 


15 


3 


23 


4 


40 


6 


38 


7 


30 


8 


29 


Total 


N - 175 
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WORKTABLE: Distribution Of Cases Of Poliomyelitis By Age In Years, 
Center County, January*June, 1962 


Age in 


Cases of 


Years 


Poliomyelitis 


Under 4 


25 


5 


10 


6 


12 


7 


9 


8 


16 


9 


13 


10 


14 


Total 


N « 99 
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80 (continued) 



5. 



WORKTABLE: Number of Longdtofemen Covered By Medical Cere Plan 
for 12 Month*. By Age In Yew* On July 1. 1955. Stockton. Celif.. July 
1955-June 1956 



Age in 
Years 




Number of 
Longshoremen 


16-19 




. 5 


20-24 




22 


25-29 




47 


30-34 




43 


35-39 




55 


40-44 




91 


45-49 




78 


50-54 




68 


55-59 




34 


60-64 




30 


6569 




16 


Unknown 




3 


Total 




N « 492 



6 . 

WORKTABLE: Distribution Of 100 One-Year-Old Babies. By Weight In 
Pounds, Center County Baby Clinic. 1965 



Weight in 
Pounds 


One-Year- 
Old Babies 


20-21 


4 


22-23 


15 


24-25 


31 


26-27 


35 


28-29 


12 


30-31 


3 


Total 


N - 100 
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80 (continued) 
7. 



WORKTABl.E: Distribution of Cases of Disbetes By Age In Yean, Center 
County, 1965 



Age in 


Number of 


Years 


Cases 


5*24 


15 


25*44 


44 


*5*54 


61 


55-64 


79 


6594 


97 


> 94 


Unknown 



81 In example one, because the data is open at each end, you cannot compute the mean; 
however, because the frequencies of the open-ended intervals are given, we will be able to 
find the median. 

In example two, the number of patients > 60 years old is more than half of the total 
number of patients; therefore, the median would fall within the open-ended interval and 
neither mean nor median can be computed. 

In example three, all values and their frequency arc given; therefore, cither mean or 
median can be computed 

In example four, the open-ended interval (under 4) allows for the computation of only 
the median. 

In example five, the missing values (Unknown) allow for the computation of neither the 
median nor the mean. 

In example six, all values and their frequency are given; therefore, cither mean or median 
can be computed. 

In example seven, the frequency for the open-ended interval (> 94) is unknown; 
therefore, neither the mean nor the median can be computed. 

We can see from the above examples that the mean / median can be computed anytime 
the mean / median can, but not the reverse. 
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What is the similarity and the difference in data that can be used to compute the mean 
and the median? 



83 From memory, define Measure of Central Tendency (think your answer): 



84 From memory, define mean (think your answer): 



85 From memory, define median (think your answer): 



A 



* 



3 
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86 From memory, define mode (think your answer): 



87 On which of the Measures of Centra) Tendency will extreme, atypical values have the 

most undesirable effect? 

What is that effect? (Think your answer.) 



88 Which Measure of Central Tendency cannot be computed from data having "open-ended 
intervals?” 
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MEETING THE CONDITIONS FOR USE 

89 If the median is used as the reporting statistic in a major reference paper on family 
income and health, the Measure of Central Tendency to use for any subsequent related 

reporting should be the . 



90 The particular Measure of Central Tendency you use should be the one generally accepted 
for the data or the situation to which it is applied. This b true in order toensure 



46 



50 



MEASURES OF CENTRAL TENDENCY 



MEAN 

91 The mean should ahvuyt be the Measure of Central Tendency of choice when not 
prohibited by the characteristics of the data or the eventual use of the statistics. List the 
three conditions which prohibit the use of the mean: 

I. - 



2 . 



3. 



92 In any particular situation the mean may be the Measure of Central Tendency of choice 

except that it is not the one for the situation or 

the data;in this instance we might have to forego certain particular preferences in order to 

achieve 

in our Measure of Central Tendency. 



93 In any particular situation the mean may be the Measure of Central Tendency of choice 

except that the data contains 

large or small atypical values. In 

this instance the mean would not be used because it would be biased unrealistically toward 
the values. 
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MEASURES OF CENTRAL TENDENCY 
MEAN 

94 In any particular situation the mean may be the Measure of Central Tendency of choice ex* 

cept that the data contain intervals. This condition would 

prevent you from being able to compute the sum of needed. 



95 Unless prohibited by the characteristics of the or the 

eventual of the statistics, the is always the 

Measure of Central Tendency of choice. 
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MEDIAN 

96 The fust condition you should consider when you are evaluating the merits of using the 
me dian is the same as that for any other Measure of Central Tendency; namely, 

I — 



The other two conditions that favor the use of the median are those that prohibit the use 
of the mean; namely, 

2. — 



3. 



97 I. The median is used when it is the Measure of Central Tendency generally accepted 
for the data or the situation involved. 

2. The median is used when the data contain extremely large or extremely small 
atypical values. 

3. The median is used when the data contain open-ended intervals. 

The median is probably used most often when the first choice Measure of Central 
Tendency cannot be used. Therefore, if you remember the specific conditions that 

prohibit the use of the you will know when to use the median. 
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MEASURES OF CENTRAL TENDENCY 
MODE 

98 We cannot compute either the mean or the median when the data contain m/m v ; values, 
as in this list of ages: 

2 

3 

3 

3 

4 

and I unknown vat le. 

However, can we compute th.* mode for the above example? 

yes, the mode value is 

no, because - 



o 

ERIC 
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99 We can compute the mode in the preceding frame because we can see that 3 is the most 
frequently occurring value. However, sometimes missing values prevent the use of the 
mode. Look at the following example: 



Here it is possible that the 20 “unknown’* values could all be the same value and that this 
value could be 4 visits. Since we cannot prove that this b not the case, we add the 
frequency of missing values to the second highest frequency of a known value: 



REMEMBER: When the data contain missing values, find the sum of the frequency of 
missing values and the second highest frequency of a known value. If this sum is greater 
than the highest frequency of a known value, the mode cannot be computed. 

The new frequency of SO b greater than the frequency of 40 for the known value which 

occurs most often. Therefore, we can / cannot compute the mode. 



50 



NUMBER 40 



41 



OF 30 
CHILDREN 

20 




31 




10 



UlkMWI 



0 



12 3 4 

NUMBER OF CUNIC VISITS 



40 

NUMBER 



OF 30 



50 




The possible 
frequency of 4 
visits is now: 



ii 



CHILDREN 20 



21 




I 2 3 4 

NUMBER OF CLINIC VISITS 
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100 Although the mode can be used with atypical values, open-ended intervals, and many 
eases of missing values, it is still not generally used unless: 

I. 



(Hint: This is the one common for all.) 

2 . 



(Hint: This condition has to do with the definition of the mode.) 



101 First, the mode is used when it is the Measure of Central Tendency generally accepted Tor 
the data or the situation involved. Second, the mode is not necessarily prohibited if the 
data have atypical or missing values or open-ended intervals— it can be used. However, the 
mode is still not usually used unless the investigator has a particular interest in the 

most valuc(s). 



MEASURES OF CENTRAL TENDENCY 

MODE 



102 Write the namc(s) of the particular Measure of Ccnltal Tendency to which each of the 
conditions below applies: 

1. Always the Measure of Central Tendency of choice unless prohibited by 

specific circumstances. 

2. The Measure of Central Tendency generally accepted for the data or 

situation involved. 

3. The data contain extremely large or extremely small atypical values. 

4. The data contain open-ended intervals. 

5. The interest of the investigator is in the most frequently occurring values. 

6. Conditions 3 or 4 and the decision to use the Measure of Central 

Tendency of “second choice.” 



103 From memory, recall the conditions for using the mean: (Think your answer.) 



104 From memory, recall the conditions for using the median: (Think your answer.) 
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105 From memory, recall the conditions for using the mode: (Think your answer.) 
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THE MEASURES OF CENTRAL TENDENCY 

From our earlier discussion you can see that “representativeness” and “comparability” 
are important characteristics when considering a Measure of Central Tendency. To answer 
the question, “How can a mean or median be used?”, we simply expand the two 
characteristics as follows: 

A mean or median can be used to all values of its 

distribution, they can be to the mean or median 

for other distributions, and/or they can be used as a “normal” value against which 

individual values of their distribution can be • 



A sanitarian reporting on his activities for the past 6 months says that he “averaged” 3.5 
inspections per day. How is he using the Measure of Central Tendency? 



The median income of Solka City West Clinic patients is $3300 per year; the median 
income of Solka City East Clinic patients is $5500 per year. 

How is the Measure of Central Tendency'used in this instance? 



All mothers having delivery complications had less than the mean number of clinic visits 
during their first sue months of pregnancy. How is the Measure of Central Tendency being 

used in this example? 
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MEASURES OF CENTRAL TENDENCY 

USING THE MEASURES OF CENTRAL TENDENCY 



110 Both the mean and median can be used to represent all values of their distribution: both 
mean and median can be compared to means or medians of other distributions; and both 
mean and median can be used as a “normal” value against which individual values of their 
distribution can be compared. 

However, because it can also be used in further statistical computation and applications, 

the mean / median is the Measure of Central Tendency of choice. 



Ill Though it will not be covered further in this Lesson, an example of a more complex 
statistic requiring the use of the mean is the “standard deviation”-a commonly used 
measure of dispersion within a distribution. 



112 From memory recall the uses of the mean and median: 




MEASURES OF CENTRAL TENDENCY 
USING THE MEASURE OF CENTRAL TENDENCY 

113 Use the worktable below to answer the questions: 

WORKTABLE: Kypothetica! Distribution Of 74 Restaurants By Num- 
ber Of Inspections During One Year. 



Number of 


Number of 


Inspections 


Restaurants 


1-2 


6 


3-4 


12 


5-6 


22 


7-8 


19 


9-10 


11 


11-12 


4 


Total 


N * 74 



1 , Which Mcasure(s) of Central Tendency could be used on the above data: 



2. Which Mcasure(s) of Central Tendency should be used on the above data if not 
otherwise prohibited? 
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114 Use the worktable below to answer the questions: 



WO R KT A B LE : Distri bu tion 
January -October, 1962 


Of Paralytic Polio Cases, By Age, Texas, 


Age in 


Number of 


Years 


Cases 


0-4 


77 


5*9 


22 


10-14 


8 


15-19 


5 


20-29 


4 


30*39 


3 


40-49 


5 


Total 


N = 124 



1. Which Measure(s) of Central Tendency could be used on the above data? 



2. Which Measure(s) of Central Tendency should not be used in view of the 

characteristics of the data? 

Why? 
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USING THE MEASURES OF CENTRAL TENDENCY 



115 Use the worktable below to answer the questions: 





WORKTABLE: 


Distribution Of 185 Boys By Weight. 






Weight 
In Pounds 


Number of 
Boys 






60*64 


3 






65-69 


23 






70-74 


50 






75-79 


51 






80-84 


31 






85-89 


11 






90-94 


4 






95-99 


3 






* >100 


4 






Total 


N = 180 




* > is a symbol meaning "greater than or equal to." 



1. Which Measure(s) of Central Tendency could be used on the above data? 



2. Which Mcasurc(s) of Central Tendency could not be used on the above data? 
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A health administrator planning for peak periods of activity would be interested in 
which of the three types of Measures of Central Tendency? 

Why? 



An investigator wishes to compare income in a particular state with that of other 
states. How will his methods of reporting “average” income be affected by the way 

other states report their average income? 



An investigator in the incidence of VD will be doing exhaustive statistical analysis on 
his data. Which is the Measure of Central Tendency he will use? 

Why? 



\ 



From memory, recall the data used to compute the mean or the median: (Think your 
answer.) 
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MEASURES OF CENTRAL TENDENCY 
USING THE MEASURES OF CENTRAL TENDENCY 

From memory, recall how the mean and the median can be used: (Think your answer.) 



What arc the conditions for using the mean, median, or mode? (Think your answer.) 
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MEASURES OF CENTRAL TENDENCY 



RECOGNIZING CHARACTERISTICS OF THE DATA 

122 As we mentioned at the beginning of this Lesson, detailed techniques of computation are 
not covered. Rather, they are included in Guides which you will use whenever the need 
arises. However, you should learn how to recognize certain characteristics of the data that 
will affect the selection of the correct computational techniques. 
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MEASURES OF CENTRAL TENDENCY 



DATA WITH N OF < 50 or > 50 

123 The technique you will use to compute a Measure of Central Tendency will often depend 
on the Number of values involved. Identify the random lists of values below as having an 
N (count of individual values or total of frequencies) of . . . 

a. less than 50 (< 50) 

b. greater than or equal to 50 (> 50) 

1. A random list of values: 1, 1, 1, 1, 1, 1 ,2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 

3, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 5,5,5, 5, 6, 6, 

6 , 6 . 

2. A random list ofvalues: 5, 5, 10, 10, 15, 15, 20, 20, 20, 25, 25, 25, 30, 30, 

30, 35, 35, 40, 40, 45, 45, 45, 45, 45, 50, 50, 50, 50, 55, 55, 55, 55, 55, 55, 
55, 60, 60, 60, 60, 60, 65, 65, 70, 75, 80. 

3. A random list ofvalues: 1, 2, 3, 3, 7, 1 1, 1 1 , 1 1, 18, 20, 24, 24, 29, 30, 34, 

37, 42, 43, 48, 50, 50, 55. 

4. 



WORKTABLE: A Random List of Values. 



Values 


Frequency of 
Value 




2 


6 




4 


18 




6 


24 




8 


10 




10 


8 




WORKTABLE: 


A Random List Of Values. 




Values 


Frequency of 
Value 




10-19 


14 




20-29 


23 




30-39 


20 




40*49 


15 




50-59 


4 
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In any list of values, the Number of values must not be confused with the extent to which 
different figures representing size or quantity occur. In random list 1. in Frame 1 23 the N 

of 53 values is represented by only different figures; in' worktable 4. the 

N of 66 values is represented by only different figures. 



The size of the figures representing the values of a list in no way indicates the N of values 
in that list. In list No. 2. of Frame 1 23, there is an N of < 50 values (45 to be exact) and 

yet the sizes of the values range up to (well above 50); in No. 4., the 

worktable, there is an N of > 50 (66 values) and yet the highest value is only 

(well below 50). 



The difference between the lowest value of a list and the highest in no way indicates the 
I V of the list. In list No. 3. of Frame 123, there is an N of only 22 values (< 50) and yet 

the range represented is above / below 50; in No. 4., the worktable, there is anN 

of 66 values C> 50) and yet the range represented is above / below 50. 



To find the N for values listed in worktables you need only add the figures listed in the 
frequency column. The fourth and fifth examples (Nos. 4. and 5.) in Frame 123 illustrate 

the fact that a worktable usually will have an N < 50 / N ^ 50 . 
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MEASURES OF CENTRAL TENDENCY 



DATA WITH DISCRETE OR CONTINUOUS VALUES 

128 The number of inspections made by 100 sanitarians during a particular week ranged from 

1 through 5 each. Therefore, each sanitarian made either : .» * * 

or inspections. 



129 For statistical purposes, a sanitarian either makes an inspection or he doesn t. A sanitarian 
cannot make .5 or 2.75 actual inspections. Therefore, the values representing the number 
of inspections made by each sanitarian are said to be . . . 

a. discrete values (indivisible units or counts) . 

b. continuous values (measurable as portions or fractions) . 



130 Tire humidity readings for 75 hospital nurseries ranged from .46 through .54 each. 
Therefore, can we say that each hospital had a humidity reading of either 
.46, .47, .48, .49, .50, .51, .52, .53, or .54? 

yes 

no 



131 All values are not discrete, i.e., indivisible units or counts. Sometimes values are units or 
counts that are at best very close approximations. For example, between humidity 
readings of .46 and .47 there may be actual readings of .461 , .462, .463, ... and .469; or 
between .461 and .462, there may be actual readings of .4611, .4612, .4613 ... and 
.4619. Therefore, the values representing humidity readings are said to be . . . 

a. discrete values (indivisible units or counts). 

b. continuous values (measurable as portions or fractions) depending on 
the accuracy of the gauge. 
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MEASURES OF CENTRAL TENDENCY 
DATA WITH DISCRETE OR CONTINUOUS VALUES 



132 DISCRETE values arc indivisible units or counts that either happen or do not happen. 
They arc usually counted not measured. 

CONTINUOUS values are divisible units or counts that are stated in that form (fractional 
or whole numbers) which can be most accurately approximated (measured) and most 
conveniently used. 

Identify the values named below as either . . . 

a. discrete 

b. continuous 

1. inspection 

2. millimeters of blood pressure 

3. age 

4. weight 

5. person 

6. height 

7. pregnancy 

8. illness 

9. innoculation 
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MEASURES OF CENTRAL TENDENCY 



DATA 'WITH VALUE RANGE OF > 14 or < 15 

133 The technique you will use to compute a Measure of Central Tendency will often depend 
on the difference (range) between the highest and lowest values of your list. Identify the 
random lists of values below as having a range of . . . 

a. greater than 14(>14) 

b. less than IS (< IS) 

1. A random list of discrete values: 1 , 1 , 1, 1 , 1 , 1, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 

3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 5, 5,5, 5 , 5 , 5 , 5 , 

5 , 5 , 5 , 5 , 6 , 6 , 6 , 6 . 

2. A random list of discrete values: S, S, S, S, 10, 10, 10, 10, IS, 15, 1 5, 20, 

20, 20, 20, 20, 25,25,25,25,25,25,30, 30,30,30,35,35,35,40,40, 
40,45,45,45,45,45,45,45,45, 45, 50, 50, 50,50,50,50,50,50,55, 
55, 55, 55, 55, 55, 60, 60, 60, 60, 60, 60, 65, 65, 65, 75, 75, 80, 90. 



.3. 

WORKTABLE: A Random List Of Discrete Values. 





Value 


Frequency of 
Value 




2 


6 




4 


IB 




6 


24 




8 


10 




10 


B 




Total 


66 


4. 




WORKTABLE: 


A Random List Of Discrete Values. 




Value 


Frequency of 
Value 




10*19 


14 




20*29 


23 




30*39 


20 




40*49 


15 




50*59 


4 




Total 


126 
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; . 134 If you are on this frame because the simplicity of the last one made you suspicious, let’s 

look at each example in turn to see how really simple the frame was. 

! v 1, The high value is 6, the low value 1 , the difference is and therefore, 

r > 14/ < is . 
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135 In the list: 

2. High value minus low value = ; >14/ <15 . 

3. minus = ; >14/ <15 . 

4. minus = ; > 14/ < IS . 



136 The four examples in Frame 133 support the fact that the question of > 14 or < 15 arises 
usually (if not always) when there is an N of < 50 / > 50 . 



137 Does further inspection of the four examples in Frame 133 support the fact that the list 
of values are in fact discrete or are treated as discrete values? 

yes 

no 
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138 Actually the purpose of the >14 or <15 determination is to estimate the approximate 
number of different value sizes (figures) the data contains. This is particularly use u 
when you have an extremely large N. 

However, you saw in our discussion of discrete and continuous values, that only with 
discrete values can such a determination be made. Therefore, we must employ certain 
unusual devices and know certain things about our values to make the >14 or <15 
determination work with continuous values. 

If we knew, for 75 humidity readings ranging from .46 through .54, that all were reported 
to the nearest hundredth, then how many possible readings arc there? 



139 You could have gotten the right answer to the last question by counting the values that 
could occur (in the hundredth) on your fingers; or you could have done as follows: 

1 . Ignoring the decimal places, 54 

subtract the lowest value - _46_ 

from the highest 8 

2. Add 1 + _L 

9 (which is how many different 
values may occur) 

What if you know (can observe) that many, if not most, of the 75 humidity readings arc 

sported to the nearest thousandth - would you then have > 14 or< 15 ? 

How many different values would be possible? 



140 The precision with which values are stated is usually obvious to you at the onset so that 
you would know in the last problem if the lowest reading should be stated as .46 or .460. 

If the latter then: 

540 (ignore decimal places) 

- 460 (ignore decimal places) 

80 (this is >14) 

+ 1 

8 1 (which is how many different values may occur) 
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MEASURES OF CENTRAL TENDENCY 
DATA WITH VALUE RANGE OF >14 or <15 



141 Check the appropriate descriptions below that best describe the data that follows: 

a. N<50 

b. N>50 

c. discrete values 

d. continuous values 

The following is a list of clinic visits made by each woman admitted to prenatal service in 
Walker County who delivered during 1960: 2, 5, 1, 3,2, 4, 5, 7, 3, 6, 1, 3, 4, 2, 5,4,3, 6. 
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142 Check the appropriate descriptions below that best describe the data that follows: 

a. N<50 

b. N>50 

c. discrete values 

d. continuous values 

e. range >14 

f. range <15 



WORKTABLE: Distribution Of Well-Child Clinics, By Number Of 
Children Attending, Jones County, Year Ending June 30, 1960. 



Number of 
Children 


Number of 
Clinics 




10-14 


6 




15-19 


9 




20-24 


11 




25-29 


8 




30-34 


14 




35-39 


8 




40-44 


3 




45-49 


1 
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MEASURES OF CENTRAL TENDENCY 
DATA WITH VALUE RANGE OF > 14 or < 15 

143 Check the appropriate descriptions below that best describe the data that follows. 

a. N<50 

b. N>50 

c. discrete values 

d. continuous values 

e, range >14 

f„ range <15 



WORKTABLE: Distribution Of 2-Year-Old Children Attending Well-Child 
Clinics, By Height In Inches, Jones County, Aprll-June, 1960. 





Height In 
Inches 


Number of 
Children 




32 


1 




33 


4 




34 


7 




35 


9 




36 


13 




37 


9 




38 


7 




39 


6 




40 


2 




41 


1 




42 


1 



i 
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144 Check the appropriate descriptions below that best describe the data that follows: 

a. N<50 

b. N>50 

c. discrete values 

d. continuous values 

e. range > 14 

f. range < 15 



Following is a list of weights to the nearest tenth of a pound at birth for live births 
occurring during 1960 to parent!; who are residents of Jones County: 3.4, 4.9, 



5.6, 


11.6, 


8.5, 


9.1, 


7.6, 


8.2, 


6.7, 


7.4, 


6.0, 


6.5, 


9.6, 


9.8, 


10.0, 


7.5, 


8.3, 


7.7, 


8.1, 


7.6, 


8.2, 


7.9, 


8.0, 


6.8, 


7.4, 


6-9, 


7.2, 


5.0, 


5.9, 


6.2, 


10.9, 


9.7, 


8.4. 


9.2, 


8.8, 


8.0, 


7.8, 


8.2, 


7.6, 


7.5, 


9.2, 


6.6, 


7.4, 


7.1, 


8.3, 


8.1, 


7.5, 


7.7, 


8.2, 


9.1, 


8.5, 


4.9, 


6.3, 


5.9, 


7.8, 


8.1, 


7.9, 


8.0, 


7.6, 


6.8, 


7.2, 


10.5 


, 9.4 


, 8.7, 


, 9.2, 


, 6.8, 


7.0, 


7.2, 


6.3, 


5.9. 
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145 Check the appropriate descriptions below that best describe the data that follows. 

a. N <50 

b. N >50 

c. discrete values 

d. continuous values 

e. range >14 

i 

f. range <15 



WORKTABLE: Distribution Of Women Admitted To Prenatal Service, By 

Age In Years, Jones County, I960. 



Age in 
Years 


Number of 
Women 


15-19 


15 


20*24 


25 


25*29 


10 


30-34 


6 


3539 


5 


40*44 


3 


45*49 


1 
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RESULTS OF FIELD DEMONSTRATIONS 



; H 

I f 

Field demonstrations of Measures of Central Tendency were held at the Center for Disease Control, . 

Atlanta, Ga., and at the Los Angeles County Health Department, Los Angeles, Calif, Measures of 
Central Tendency is the prerequisite Lesson for the three-part course on Descriptive Statistics for the 
Health Professions, Other parts of the course are the Guide : Median and Guide : Arithmetic Mean, 

Sonic 33 students at CDC took the pretest in a supervised group. Each student was then given a copy 
of the Lesson to complete on a take-home basis. The students met together a week later to take the 

post test. ! 

The 61 Los Angeles students worked in a formal classroom setting for three half-day sessions. A total 

of 4 hours classroom time was allotted each student to work on the Lesson after taking the pretest j 

under supervision. If necessary, each student* was allowed extra time outside class to complete the 

Lesson . A posttest was administered when all of the students had completed the Lesson . j 

i 

There were specific differences between the two groups. Students at CDC had voluntarily parti- • 

cipatcd, while the Los Angeles participants had been requested to attend the course. Sixty percent of | 

each group had college degrees. But 33% of the CDC group had post-graduate degrees. In comparison, j 

8% of the Los Angeles students had post-graduate degrees. ; 



RESULTS 
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Pretest Posttest 



CDC. range = 0% - 48% 


range = 63% - 1 00% 


median = 1 5% 


median = 92% 


Los Angeles range = 5% -45% 


range = 42% - 1 00% 


median = 18% 


median = 85% 
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